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FOREWORD 


This little book is devoted to three theorems in arithmetic, 
which, in spite of their apparent simplicity, have been the objects 
of the efforts of many important mathematical scholars. The proofs 
which are presented here make use of completely elementary means, 
(although they are not very simple). 


The book can be understood by beginning college students, 
and is intended for wide circles of lovers of mathematics. 
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A LETTER TO THE FRONT 
(IN LIEU OF A PREFACE) 


March 24, 1945 
Dear Seryozha, 


Your letter sent from the hospital gave me threefold pleasure. 
First of all, your request that I send you ‘‘some little mathemati- 
cal pearls’’ showed that you are really getting well, and are not 
merely trying, as a brave fighter, to relieve your friends’ anxiety. 
That was my first pleasure. 


Furthermore, you gave.me occasion to reflect on how it is 
that in this war such young fighters as you happen to pursue their 
favorite occupation—the occupation which they cherished already 
before the war, and from which the war has tom them—so pas- 
sionately during every little respite. There was nothing like this 
during the last World War. In those days a young man who had ar 
rived at the ffront almost invariably felt that his life had been dis- 
rupted, that what had been the substance of his life before had 
become for him an unrealizable legend. Now, however, there are 
some who write dissertations in the intervals between battles, and 
defend them on their return during a brief furlough! Is it not be- 
cause you feel with your whole being, that your accomplishments 
in war and in your favorite occupations—science, art, practical ac- 
tivity—are two links of one and the same great cause? And if so, is 
not this feeling, perhaps, one of the mainsprings of your victories 
which we, here at home, are so enthusiastic about? This thought 
gratified me very much, and that was my second pleasure. 


And so I began to think about what to send you. I do not know 
you very well—you attended my lectures for only one year. Never- 
theless I retained a firm conviction of your profound and serious 
attitude toward science, and therefore I did not want to send you 
merely some trinkets which were showy but of little substance sci- 
entifically. On the other hand I knew that your preparation was not 
very great—you spent only one year in the university classroom, 
and during three years of uninterrupted service at the Front you 
will hardly have had time to study. After several days’ delibera- 
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tion I have made a choice. You must judge for yourself whether it 
is a happy one or not. Personally I consider the three theorems of 
arithmetic which I am sending you, to be genuine pearls of our 
science. 


From time to time, remarkable, curious problems turn up in a- 
rithmetic, this oldest, but forever youthful, branch of mathematics. 
In content they are so elementary that any schoolboy can under- 
stand them. They are usually concerned with the proof of some 
very simple law governing the world of numbers, a law which turns 
out to be correct in all tested special cases. The problem now is to 
prove that it is in fact always correct. And yet, in spite of the ap- 
parent simplicity of the problem, its solution resists, for years and 
sometimes centuries, the efforts of the most important scholars of 
the age. You must admit that this is extraordinarily tempting. 


I have selected three such problems for you. They have all 
been solved quite recently, and there are two remarkable common 
features in their history. First, all three problems have been solved 
by the most elementary arithmetical methods (do not, however, con- 
fuse elementary with simple; as you will see, the solutions of all 
three problems are not very simple, and it will require not a little 
effort on your part to understand them well and assimilate them). 
Secondly, all three problems have been solved by very young, be- 
ginning mathematicians, youths of hardly your age, after a series of 
unsuccessful attacks on the part of “‘venerable’’ scholars. I[sn’t 
this a spur full of promise for future scholars like you? What an 
encouraging call to scientific daring! 


The work of expounding these theorems compelled me to pene- 
trate more deeply into the structure of their magnificent proofs, and 
gave me great pleasure. 


That was my third pleasure. 


I wish you the best of success—in combat and in science. 


Yours, 


A. Khinchin 


CHAPTER I 


VAN DER WAERDEN’S THEOREM ON 
ARITHMETIC PROGRESSIONS 


§1 


In the summer of 1928 I spent several weeks in Gottingen. As 
usual, many foreign scholars had arrived there for the summer se- 
mester. I got to know many of them, and actually made friends with 
some. At the time of my arrival, the topic of the day was the bril- 
liant result of a young Hollander, van der Waerden*¥ who at that time 
was still a youthful beginner, but is now a well-known scholar. 
This result had just been obtained here in Gottingen, and in fact, 
only a few days before my arrival. Nearly all mathematicians whom 
I met told me about it with enthusiasm. 


The problem had the following history. One of the mathemati- 
cians there (I forget his name**) had come upon the following prob- 
lem in the course of his scientific work: Imagine the set of all nat- 
ural numbers to be divided in any manner whatsoever into two parts 
(e. g., into even and odd numbers, or into prime and composite num- 
bers, or in any other way). Can one then assert that arithmetic pro- 
gressions of arbitrary length can be found in at least one of these 
parts? (By the length of an arithmetic progression I mean here, and 
in what follows, simply the number of its terms.) All to whom this 
question was put regarded the problem at first sight as quite simple; 
its solution in the affirmative appeared to be almost self-evident. 
The first attempts to solve it, however, led to nought. And as the 
mathematicians of Gottingen and their foreign guests were by tradi- 
tion in constant association with one another, this problem, provok- 
ing in its resistance, soon became the object of general mathemat- 
ical interest. Everyone took it up, from the venerable scholar to the 
young student. After several weeks of strenuous exertions, the 
problem finally yielded to the attack of a young man who had come 
to Géttingen to study, the Hollander, van der Wacdien: I made his 
acquaintance, and learned the solution of the problem from him per- 
sonally. It was elementary, but not simple by any means. The prob- 
lem turned out to be deep, the appearance of simplicity was decep- 
tive. 

*B. L. van der Waerden, Beweis einer Baudetschen Vermutung, Nieuw Arch. 


Wiskunde 15, 212-216 (1927). (Trans.) 
**Most probably Baudet; cf. the preceding footnote. (Trans.) 
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Quite recently, M. A. Lukomskaya (of Minsk) discovered and 
sent me a considerably simpler and more transparent proof of van 
der Waerden’s theorem, which, with her kind permission, I am going 
to show you in what follows. 


§ 2 


Actually, van der Waerden proved somewhat more than what was 
required. In the first place, he assumes that the natural numbers are 
divided, not into two, but into arbitrarily many, say 4, classes 
(sets). In the second place, it turns out that it is not necessary 
to decompose the entire sequence of natural numbers in order to 

arantee the existence of an arithmetic progression of prescribed 
lambitrarily large) length / in at least one of these classes; a cer- 
tain segment of it suffices for this purpose. The length, n (k, 1), 
of this segment is a function of the numbers & and /. Of course it 
doesn’t matter where we take this segment, so long as there are 
n(k,l) successive natural numbers. 

Accordingly, van der Waerden’s theorem can be formulated as 
follows: 

Let k and I be two arbitrary natural numbers. Then there exists 
a natural number n(k, l) such that, if an arbitrary segment, of length 
n(k, 1), of the sequence of natural numbers is divided in any manner 
into k classes (some of which may be empty), then an arithmetic 
progression of length | appears in at least one of these classes. 

This theorem is true trivially for /=2. To see this, it suffices 
to set n(k, 2)=4+1; for if &+1 numbers are divided into k classes, 
then certainly at least one of these classes contains more than one 
number, and an arbitrary pair of numbers forms an arithmetic pro- 
gression of length 2, which proves the theorem. We shall prove the 
theorem by induction on /. Consequently, we shall assume through- 
out the following that the theorem has already been verified for 
some number />2 and for arbitrary values of &, and shall show that 
it retains its validity for the number /+1 (and naturally also for all 
values of 4). 


§3 


According to our assumption, then, for every natural number & 
there is a natural number n(k, 2) such that, if an arbitrary segment, 
of length n(k,l), of the natural numbers, is divided in any manner 
into & classes, there exists in at least one of these classes an 
arithmetic progression of length /. We must then prove that, for 
every natural number &, an n(k,/+1) also exists. We solve this 
problem by actually constructing the number n(k,/+1). To this end 
we set 
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Jo=l, no=nlk, dD 


and then define the numbers q;, qo, .--, Ra, No, --. successively as 
follows: If q,_, and n,_, have already been defined for some s>0, 
we put 


(1) 9,720 ,.1Iee1) Mg Mk D  (S=1, 2.) 


The numbers n,, 9, are obviously defined hereby for an arbitrary 
s20. We now assert that for n(k,/+1) we may take the number q,. 
We have to show then that if a segment, of length q,, of the se- 
quence of natura] numbers is divided in any manner into & classes, 
then there is an arithmetic progression of length /+1 in at least one 
of these classes. The remainder of the chapter is devoted to this 
proof. 
In the sequel we set /+1 =’ for brevity. 


§ 4 


Suppose then that the segment A, of length q,, of the sequence 
of natural numbers is divided in an arbitrary way into k classes. We 
say that two numbers a and 5 of A are of the same type, if @ and b 
belong to the same class, and we then write a~b. Two equally long 
subsegments of A, 5=(a,a+1,...,a+r) and 5’=(a4,a'+1,...,a +r), 
are said to be of the same type, if a=a‘4 a+l=a’+l, ..., atr=a’+r, 
and we then write 5=5% The number of different possible types for 
the numbers of the segment A is obviously equal to &. For segments 
of the form (a,a+1) (i.e., for segments of length 2) the number of 
possible types is k”; and in general, for segments of length m, it is 
k™, (Of course not all these types need actually appear in the seg- 
ment A.) 

Since g,=27,_,94,., (see (1)), the segment A can be regarded as 
a sequence of 2n,_, subsegments of length q,_,. Such subsegments, 
as we have just seen, can have k7*-1 different types. The left half 
of the segment A now contains n,_, such subsegments, where 
n, ark *-}, L) according to (1). Because of the meaning of the 
number n(kik-), !), we can assert* that the left half of the segment 
A contains an arithmetic progression of / of these subsegments of 


*Work with the initial numbers of the Mp. subsegments. (Trans.) 
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the same type, 
Ay Ag, eons A, 


of length 9,.,; here we say for brevity that equally long segments 
A, form an arithmetic progression, if their initial numbers form 
such a progression. We call the difference between the initial num- 
bers of two neighboring segments of the progression A:, Ag, ..., A, 
the difference d; of this progression. Naturally the difference be- 
tween the second (or third, fourth, etc.) numbers of two such neigh- 
boring segments is likewise equal to d;. 

To this progression of segments we now add the succeeding, 
(1+1)-st, term A,+ (we recall that /’=1+1) which may already pro- 
ject beyond the boundary of the left half of the segment A, but 
which in any case still belongs entirely to the segment A. The seg- 
ments A,,Ag,...,A),A;* then form an arithmetic progression, of 
length /’=/+1 and difference d;, of segments of length q,_,, where 
A1,Aa, «+.,A, are of the same type. We know nothing about the type 
of the last segment A+ 

This completes the first step of our construction. It would be 
well if you thought it through once more before we continued. 


§5 


We now proceed to the second step. We take an arbitrary one of 
the first 1 terms of the progression of segments just constructed. 
Let this term be A,,, so that 1<i,</; A,, is a segment of length 
q;,.,- We treat it the same way as we treated the segment A. Since 
9,21 =22,09,-91 the left half of the segment A. can be regarded 
as a sequence of n,_. subsegments of length 9,_,. For subsegments 
of this length there are h%k-2 types possible, and on the other hand 
nyp.g =n(k #2, l) because of (1). Therefore the left half of Ai, must 
contain a progression of | of these subsegments of the same type, 
A; ig (S424, of length g,_,. Let dz be the difference of this pro- 
gression (i.e., the distance between the initial numbers of two 
neighboring segments). To this progression of segments we add the 
(J+ 1)-st term A,,1% about whose type, of course, we know nothing. 
The segment Ava’ does not have to belong to the left half of the 
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segment A;, any more, but must obviously belong to the segment 
Ais 

We now carry over our construction, which we have executed 
up to now in only one of the segments A, v congruently to all the 
other segments A;_ (1gi:¢/). We thus obtain a set of segments 
A iio (l<gt,<l4 1<t.<l) with two indices. It is clear that two ar- 
bitrary segments of this set with indices not exceeding / are of the 
same type: 


A A 


You no doubt see now that this process can be continued. We 
carry it out & times. The results of our construction after the first 
step were segments of length q,_,, after the second step, segments 
of length g,_5, etc. After the k-th step, therefore, the results of the 
construction are segments of length go=1, i.e., simply numbers of 


our original segment A. Nevertheless we denote them as before by 


Di ipscky (lgta, lo oeey i,<t’). 


For lssgk andl]<iz,..., bes bty est Sl we have 
(2) A 


£ 
, 
s 


bbe gste Ore ci 
We now make two remarks which are important for what follows. 
1) In (2), if s<k and if bo4y> lego -e-ot, are arbitrary indices 
taken from the sequence 1, 2, ..., é, ¢% then the number 
appears in the same position in the segment 


Eri geced boy posed 
does in the segment 


igi, 98 the number Od ae TY 
A,+ ,¢. Since these two segments are of the same type because of 


2 
(2), it follows that 


(3) ip igen ly ae eA ig res ae 
if lgia,..., 8,8 8 Sl and 1Si, gy tego. i SU (1g ssh. 
2) For sgk and ¢[=i, +1, A;,. . . andA.  . _.* are ob- 


ots yt bqeeeb yt 

viously neighboring segments in the s-th step of our construction. 

Therefore for arbitrary indices ¢,,,,..-,%,, the numbers 

en cee as . and A, . ,s, ; ear in the same 
ee ee ee he ee ee oe ae Ss eee app 


position in two such neighboring segments, so that (with i/=i,,,) 
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(4) =d.. 


- . A dae . — ° . - ° - 
beeelo ly ES es oo Meeeloly h es s 


§6 


Now we are near our goal. We consider the following +1 num- 
bers of the segment A: 


ao =Ay?7717...1° 
a,=Ai7777,..1° 
(5) ao=Airi 77..." 
a, =Aidy teow gd @ 


Since the segment A has been divided into & classes, and we have 
k+1 numbers in (5), there are two of these numbers which belong 
to the same class. Let these be the numbers a, and a, (r<s), so 


that 
(6) Aya =A at 
Se’ ee ee” Se em’ 
c ker s hes 


We consider the /+1 numbers 


(7) =A) iiit’..e sist’), 


t 


The first 2 numbers of this group (i. e., those with i<l belong to 
the same class because of (3). The last (:=14, however, is of the 
same type as the first because of (6). Consequently all /+1 num- 
bers in (7) are of the same type, and to prove our assertion we have 
only to show that these numbers form an arithmetic progression, 


i.e., that the difference c,,,—¢, (1<i¢/) does not depend on i. 


We set i+1=2’ for brevity. Further let 


Com=AL iti iil’ .t  (OSmSs-r), 


r m s-r-m kes 


=c__, and hence 
t+] 
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c C22 (c. ce ). 
i+] m=l1 0™ i,m-1 
Because of (4) we have 
Con Emel ee eae be Ae ae NS a OT ee 
Se ae ewes eee” —_— —— eee” 
r m s-r-m kes r m-l s-r-m+l1 k-s 


Thus the difference 


4p G =H 44, + yo tee +a, 


and is indeed independent of i, which completes the proof of our 
assertion. 


You see how complicated a completely elementary construc- 
tion can sometimes be. And yet this is not an extreme case: in the 
next chapter you will encounter just as elementary a construction 
which is considerably more complicated. Besides, it is not out of 
the question that van der Waerden’s theorem admits of an even 
simpler proof, and all-research in this direction can only be wel- 
comed. 


CHAPTER II 


THE LANDAU-SCHNIRELMANN HYPOTHESIS 
AND MANN’S THEOREM 


§1 


You have perhaps heard of the remarkable theorem of 
Lagrange, that every natural number is the sum of at most 
four squares. In other words, every natural number is either 
itself the square of another number, or else the sum of two, 
or else of three, or else of four such squares. For the purpose 
at hand it is desirable to understand the content of this theorem 
in a somewhat different form. Let us write down the sequence of 
all perfect squares, beginning with zero: 


(S) 0, 1, 4, 9, 16, 25, ... . 

This is a certain sequence of whole numbers. We denote it by 5S, 
and imagine four completely identical copies of it, 5,, 5,, gy? 4? 
to be written down. Now we choose an arbitrary number a from Si 
an arbitrary number a from S,, an arbitrary number a? from Ss and 
an arbitrary number a? from S,, and add these numbers together. 


The resulting sum 
(*) n=a/+a)+a. +a, 
can be 

1) zero (if a, =a, =a, =a, =0); 

2) the square of a natural number (if in some representation (*) 
of the number n three of the numbers a,, a,, a,, a, are zero and 
the fourth is not zero); 

3) the sum of two squares of natural numbers (if in some rep- 
resentation (*) of the number n two of the numbers 4a, By, Gy, By 
are zero and the other two are not zero); 

4) the sum of three squares of natural numbers (if in some rep- 
resentation (*) of the number 7 one of the numbers aj, Go, A,, Gy 
is equal to zero and the remaining three are not zero); 
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5) the sum of four squares of natural numbers (if in some rep- 
resentation of the number n all four numbers are different from zero). 

Thus the resulting number rn is either zero or else a natural 
number which can be represented as the sum of at most four 
squares, and it is clear that conversely every natural number can be 
obtained by the process which we have described. 

Now let us arrange all natural numbers nr which can be obtained 
by means of our process (i.e., by the addition of four numbers taken 
respectively from the sequences S,, S,, S45, S,), in order of mag- 
nitude, in the sequence 


(A) 0, Ryn 


g? May oe 


(where O0<n,<n,.<n,<..., so that if there are equal numbers among 
those constructed, only one of them appears in (A)). The theorem 
of Lagrange now asserts simply that the sequence (A) contains all 
the natural numbers, i.e., that n,;=1, n,=2, n, =3, etc. 

We shall now generalize our process. Let there be given k 
monotonically increasing sequences of integers which begin with 


zero: 
(1) q) (1) (1) 
(A’’) 0, Br By precy Dy ones 
(2 (2 (2) ( 2) 
(A 4) 0, | > a, ’ bee m > > 
(AM) 0, a®) a‘), ae a’ *) veins 
1 2 m 


We choose arbitrarily a single number from each sequence A“) 
(l¢gi¢k) and add these & numbers together. The totality of all num- 
bers constructed in this manner, if we order them according to mag- 
nitude, yields a new sequence 


(A) 0, Pye Mos cory Mg wee 


of the same type, which we shall call the sum of the given se- 
quences A‘1), A(2), |.) A(k), 


k ; 
A=A%, 4g AM SAH 2 
t=] 
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The content of Lagrange’s theorem is that the sum $+5+S+S con- 
tains the entire sequence of natural numbers. 

Perhaps you have heard of the famous theorem of Fermat, that 
the sum S+S contains all prime numbers which leave a remainder 
of 1 when divided by 4 (i.e., the numbers 5, 13, 17, 29, ...). Per- 
haps you also know that the famous Soviet scholar Ivan Matveye- 
vitch Vinogradov proved the following remarkable theorem, on which 
many of the greatest mathematicians of the preceding two centuries 
had worked without success: 

If we denote by P the sequence 


(P) 0, 2, 3, 5, 7, 11, 13, 17, ... 


consisting of zero and all prime numbers, then the sum P+P+P 
contains all sufficiently large odd numbers. 

I have cited all these examples here for only one very modest 
purpose: to familiarize you with the concept of the sum of se- 
quences of numbers and to show how some classical theorems of 
number theory can be formulated simply and conveniently with the 
aid of this concept. 


§2 


As you have undoubtedly observed, in all the examples mention- 
ed we are concerned with showing that the sum of a certain number 
of sequences represents a sequence which contains either com- 
pletely or almost completely this or that class of numbers (e.g., 
all the natural numbers, all sufficiently large odd numbers, and 
others of the same sort). In all other similar problems the purpose 
of the investigation is to prove that the sum of the given se- 
quences of numbers represents a set of numbers which is in some 


’? in the sequence of natural numbers. It is often the 


sense ‘‘dense 
case that this set contains the entire sequence of natural numbers 
(as we saw in our first example). The. theorem of Lagrange says that 
the sum of the four sequences S contains the whole sequence of nat- 
ural numbers. Now it is customary to call the sequence A a basis 
(of the sequence of natural numbers) of orderk if the sum of k iden- 
tical sequences A contains all the natural numbers. The theorem of 


Lagrange then states that the sequence S of perfect squares is a 
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basis of order four. It was shown later that the sequence of perfect 
cubes forms a basis of order nine. A little reflection shows that 
every basis of order & is also a basis of order £ +1. 

In all these and in many other examples the ‘‘density’’ of the 
sum which is to be established is determined by particular prop- 
erties of the sequences that are added together, i.e., by the spe- 
cial arithmetical nature of the numbers which go to make up these 
sequences (these numbers being either perfect squares, or primes, 
or others of a similar nature). Sixteen years ago the distinguished 
Soviet scholar Lev Genrichovitch Schnirelmann first raised the 
qnestion: To what extent is the density of the sum of several se- 
quences determined solely by the density of the summands, irre- 
spective of their arithmetical nature. This problem turned out to be 
not only deep and interesting, but also useful for the treatment of 
some classical problems. During the intervening fifteen years it 
received the attention of many outstanding scholars, and it has 
given rise to a rich literature. 

Before we can state problems in this field precisely and write 
the word “‘density’’ without quotation marks, it is evident that we 
must first agree on what number (or on what numbers) to use to 


measure the ‘‘density’’ of our sequences with (just as in physics 


‘ 3 


the words “‘warm’’ and “‘cold’’ do not acquire a precise scientific 
meaning until we have learned to measure temperature). 

A very convenient measure of the “‘density’’ of a sequence of 
numbers, which is now used for all scientific problems of the kind 


we are considering, was proposed by L. G. Schnirelmann. Let 


(A) 0, a4 


gr @, 
be a sequence of numbers, where, as usual, all the a, are natural 
numbers and a,<a,,, (n=1,2,...). We denote by A(x) the number of 
natural numbers in the sequence (A) which do not exceed n (zero is 
not counted), so that O<A(n)<n. Then the inequality 


0<Aln)<} 
nr 


holds. The fraction A(n)/n, which for different n naturally has dif- 
ferent values, can obviously be interpreted as a kind of average 
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density of the sequence (A) in the segment from 1 to n of the se- 
quence of natural numbers. Following the suggestion of Schnirel- 
mann, the greatest lower bound of all values of this fraction is 
called the density of the sequence (A) (in the entire sequence of 
natural numbers). We shall denote this density by d(A). 

In order to become familiar at once with the elementary prop- 
erties of this concept, I recommend that you convince yourself of 
the validity of the following theorems: 

1. If a, >1 (i. e., the sequence (A) does not contain unity), then 
d(A) =0. 

2. If a, =1+r(n-1) (i.e., the sequence (A), beginning with a,, 
is an arithmetic progression with initial term ] and difference 1), 
then d(A)=1/r. 

3. The density of every geometric progression is equal to zero. 

4. The density of the sequence of perfect squares is equal to 
zero. 

5. For the sequence (A) to contain the entire sequence of nat- 
ural numbers (a, =n, n=l, 2, ...), it is necessary and sufficient that 
d(A)=1. 

6. If &A)=0 and A contains the number 1, and if €>0 is arbi- 
trary, then there exists a sufficiently large number m such that 
A(m) <em. 

If you have proved all this, you are familiar enough with the 
concept of density to be able to use it. Now I want to acquaint you 
with the proof of the following remarkable, albeit very simple, lem- 
ma_ of Schnirelmann: 


(1) d(A + B)>d(A) + d(B)-d(A) d(B). 


The meaning of this inequality is clear: the density of the sum of 
two arbitrary sequences of numbers is not smaller than the sum of 
their densities diminished by the product of these densities. This 
“*Schnirelmann inequality’’ represents the first tool, still crude to 
be sure, for estimating the density of a sum from the densities of 
the summands. Here is its proof. We denote by A(x) the number of 
natural numbers which appear in the sequence A and do not exceed 
n, and by A(n) the analogous number for the sequence 8. For brevi- 


ty we set d(A)=a, d(B)=8, A+B=C, d(C)=y. The segment (1,17) of 
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the sequence of natural numbers contains A(n) numbers of the se- 
quence A, each of which also appears in the sequence C. Let a, 
and a,,, be two consecutive numbers of this group. Between them 
there are a,,,—@,-—1=/ numbers which do not belong to A. These 
are the numbers 


a, +1, a,+2,..., a, +l=a,,,-1. 


Some of them appear in C, e.g., all numbers of the form a,+r, 


where r occurs in B (which we abbreviate as follows: r€B). There 
are as many numbers of this last kind, however, as there are num- 
bers of B in the segment (1,2), that is, B(/) of them. Consequently 
every segment of length / included between two consecutive num- 
bers of the sequence A contains at least B(/) numbers which belong 


to C. It follows that the number, C(n), of numbers of the segment 


(1,n) appearing in C is at least 


A(n) +2 B(1) 


where the summation is extended over all segments which are free 
of the numbers appearing in A. According to the definition of den- 
sity, however, B(J)281, so that 


C(n) > A(n) + B21 =A(n) + Bin-A(n)}, 


because &/ is the sum of the lengths of all the segments which are 
free of the numbers appearing in A, which is simply the number 
n-A(n) of numbers of the segment (1,7) which do not occur in A. 
But A(n)2an, and hence 


C(n) >A(n)(1—B) + Bn 2an(1-B)+ Bn, 
which yields 
C(n)/n > a+B—aB. 


Since this inequality holds for an arbitrary natural number n, we 
have 


y=d(C)>a+B-af, Q. E. D. 


Schnirelmann’s inequality (1) can be written in the equivalent 
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form 


1—d(A +B) <{1-d(A)}{1-d(B)I, 


and in this form can easily be generalized to the case of an arbi- 


trary number of summands: 


k 
1-d(A,+4,+...4+4,)< ue {1-d(A,)}. 


It is proved by a simple induction; you should have no trouble in 
carrying it out yourself. If we write the last inequality in the form 


k 
(2) d(A, +A,+...+A,)2 1+ TM il-d(4)i, 


it again enables one to estimate the density of a sum from the den- 
sities of the summands. L. G. Schnirelmann derived a series of very 
remarkable results from his elementary inequality, and obtained 
above all the following important theorem: 

Every sequence of positive density is a basis of the sequence 
of natural numbers. 

In other words, if a=d(A)>0, then the sum of a sufficiently 
large number of sequences A contains the entire sequence of nat- 
ural numbers. The proof of this theorem is so simple that I should 
like to tell you about it, even though this will divert us a bit from 
our immediate problem. 

Let us denote for brevity by 4, the sum of k sequences, each 
of which coincides with A. Then by virtue of inequality (2), 


d(A,)21~(1~a)*. 


Since a>0, we have, for sufficiently large &, 


(3) d(A,)>%. 


Now one can easily show that the sequence A,, contains the 
whole sequence of natural numbers. This is a simple consequence 
of the following general proposition. 

LEMMA. If A(n)+B(n)>n-1, then n occurs in A+B. 


Indeed, if n appears in A or in B, everything is proved. We may 
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therefore assume that n occurs in neither A nor B. Then A(n)= 
A(n-1) and B(n)= B(n-1), and consequently 


A(n-1)+B(n—1)>n-1. 


Now let a,,q,,-.-,@, and 5), b,,...,6, be the numbers of the 
segment (1,n—1) which appear in A and B, respectively, so that 
r=A(n-1), s=B(n-1). Then all the numbers 

Ay) Fos 0-9 Ay 


n—b), nb, «+ n—b . 


belong to the segment (1,n~1). There are r+s=A(n—1)+B(n-1) of 
these numbers, which is more than n—1. Hence one of the numbers 
in the upper row equals one of the numbers in the lower row. Let 
a,=n—b,. Then n=a;+5,, i.e., n appears in A+B. 

Returning now to our objective, we have, on the basis of (3), 
for an arbitrary 7: 


A, (n)>%n Yn -1) 


and therefore 
A,(n) +A,(n)>n—1. 


According to the lemma just proved, it follows that n appears in 
A,+A,=A,,. But n is an arbitrary natural number, and hence our 
theorem is proved. 

This simple theorem led to a series of important applications 
in the papers of L. G. Schnirelmann. For example, he was the first 
to prove that the sequence P consisting of unity and all the prime 
numbers is a.basis of the sequence of natural numbers. The se- 
quence P, it is true, has density zero, as Euler had already shown, 
so that the theorem which we just proved is not directly applicable 
to it. But Schnirelmann was able to prove that P+P has positive 
density. Hence P+P forms a basis, and therefore P indeed also. 
From this it is easy to infer that an arbitrary natural number, with 
the exception of l, can, for sufficiently large k, be represented as 
the sum of at most & primes. For that time (1930) this result was 
fundamental and evoked the greatest interest in the scientific 
world. At present, thanks to the remarkable work of I. M. Vinograd- 
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ov, we know considerably more in this direction, as I already relat- 
ed to you at the beginning of this chapter. 


§3 

In the preceding it was my purpose to introduce you in the 
shortest way possible to the problems of this singular and fasci- 
nating branch of number theory, whose study began with L. G. 
Schnirelmann’s remarkable work. The immediate goal of the present 
chapter, however, is a specific problem in this field, and I now pro- 
ceed to its formulation. 

In the fall of 1931, upon his return from a foreign tour, L. G. 
Schnirelmann reported to us his conversations with Landau in 
Gottingen, and related among other things that in the course of 
these conversations they had discovered the following interesting 
fact: In all the concrete examples that they were able to devise, it 
was possible to replace the inequality 


d(A+B)>d(A)+d(B)—d(A)d(B), 
which we derived in §2, by the sharper (and simpler) inequality 
(4) d(A+B)>d(A)+d(B). 


That is, the density of the sum always turned out to be at least as 
large as the sum of the densities of the summands (under the as- 
sumption, of course, that d(4)+d(B)<1). They therefore naturally 
assumed that inequality (4) was the expression of a universal law, 
but the first attempts to prove this conjecture were unsuccessful. 
It soon became evident that if their conjecture was correct, the road 
to its proof would be quite difficult. We wish to note at this point 
that if the hypothetical inequality (4) does represent a universal 
law, then this law can be generalized immediately by induction to 
the case of an arbitrary number of summands; i.e., under the as- 
sumption that 


k 
2d(A)¢1 
i=] 
we have 


k k 
(5) d( 3 4,)2 2d (A). 
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This problem could not help but attract the attention of scholars, 
because of the simplicity and elegance of the general hypothetical 
law (4) on the one hand, and on the other because of the sharp con- 
trast between the elementary character of the problem and the dif- 
ficulty of its solution which became apparent already after the first 
attacks. I myself was fascinated by it at the time, and neglected 
all my other researches on its account. Early in 1932, after several 
months of hard work, I succeeded in proving inequality (4) for the 
most important special case, d(A)=d(B) (this case must be consid- 
ered as the most important because in the majority of concrete 
problems all the summands are the same). At the same time I also 
proved the general inequality (5) under the assumption that d(4,)= 
d(A,)=...=d(A,) (it is easy to see that this result cannot be deriv- 
ed from the preceding one simply by induction, but requires a spe- 
cial proof). The method which I used was completely elementary, 
but very complicated. I was later able to simplify the proof some- 
what. 

Be that as it may, it was but a special case. For a long time 
it seemed to me that a none too subtle improvement of my method 
should lead to a full solution of the problem, but all my efforts in 
this direction proved fruitless. 

In the meantime the publication of my work had attracted the 
attention of a wide circle of scholars in all countries to the Landau- 
Schnirelmann hypothesis. Many insignificant results were obtained, 
and a whole literature sprang up. Some authors carried over the 
problem from the domain of natural numbers to other fields. In short, 
the problem became ‘‘fashionable’’. Learned societies offered 
prizes for its solution. My friends in England wrote me in 1935 
that a good half of the English mathematicians had postponed 
their usual work in order to try to solve this problem. Landau, in 
his tract devoted to the latest advances in additive number theory, 
wrote that he “‘should like to urge this problem on the reader’’. But 
it proved to be obstinate, and withstood the efforts of the most able 
scholars for a whole series of years. It was not until 1942 that the 


6é 


young American mathematician Mann finally disposed of it: he found 
a complete proof of inequality (4) (and hence also of inequality 
(5)). His method is wholly elementary and is related to my work in 
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form, althongh it is based on an entirely different idea. The proof is 
long and very complicated, and [ could not bring myself to present 
it to you here. A year later, however, in 1943, Artin and Scherk pnb- 
lished a new proof of the same theorem, which rests on an altogeth- 
er different idea. It is considerably shorter and more transparent, 
though still quite elementary. This is the proof that I should like to 
tell you about; I have written this chapter on its account, and it 
forms the content of all the sncceeding sections. 


§4 
Suppose then that A and B are two sequences. We set 4+B=C. 
Let A(n), d(A), etc. have their usual meaning. We recall that all our 
sequences begin with zero, but that only the natural numbers ap- 
pearing in these sequences are considered when calculating A(n), 


B(n), C(n). We have to prove that 
(6) d(C) > d(A)+d(B) 
provided that d(A)+d(B)<¢1. For brevity we set d(A)=a, d(B)=8 in 


what follows. 
FUNDAMENTAL LEMMA. [fn ts an arbitrary natural number, 
there exists an integer m(1<m<n) such that 


C(n) —C(n —m) > (a+ B) m. 


In other words, there exists a ‘‘remainder”’ (n—m+1,n) of the 
segment (1,7), in which the average density of the sequence C is at 
least a+f. 

We are now faced with two problems: first, to prove the funda- 
mental lemma, and second, to show that inegnality (6) follows from 
the fundamental lemma. The second of these problems is incompa- 
rably easier than the first, and we shall therefore begin with the 
second problem. 

Suppose then that the fundamental lemma has already been 
proved. This means that in a certain “‘remainder’’ (n—m+1,n) of 
the segment (1,7) the average density of the seqnence C is at least 
a+B. By the fundamental lemma, however, the segment (1,n—m) 


‘ 


again has a certain “‘remainder’’ (n—m—m’+1,n—m) in which the 


average density of C is at least a+. It is clear that by continuing 
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this process, the segment (1,7) is eventually divided into a finite 
number of subsegments, in each of which the average density of C 
is at least a+. Therefore the average density of C is also at 
least a+P in the whole segment (1,7). Since n was arbitrary, how- 
ever, we have 


d(C)2>a+B, Q.E.D. 


Thus the problem is now reduced to proving the fundamental 
lemma. We now turn to this proof, which is long and complicated. 


§5 
NORMAL SEQUENCES 


In all that follows we shall regard the number 7 as fixed, and 
all sequences which we investigate will consist of the number 0 
and certain numbers of the segment (1,7). We agree to call such a 
sequence N normal, if it possesses the following property: If the 
arbitrary numbers f and f” of the segment (1,7) do not appear in N, 
then neither does the number {+f’~n appear in N (where the case 
f=f’ is not excluded). 

If the number n belongs to the sequence C, then 


Cin) -C(n-1) = 12> (a+ B)-1, 


so that the fundamental lemma is trivially correct (m=1). Conse- 
quently we shall assume in the sequel—I beg you to keep this in 
mind—that 7 does not occur in C. 

To begin with, the fundamental lemma is easy to prove in case 
the sequence C is normal. Indeed, let us denote by m the smallest 
positive number which does not appear in C (m¢n because n, by as- 
sumption, does not occur in C). Let s be an arbitrary integer lying 
between n—m and n; n-—m<s<n. Then 0<s+m-—n<m. I say that 
s€C. For if this were not the case, the number s+m-—n, because of 
the normality of C, would not appear in C. But we have just seen 
that this number is smaller than m, whereas m, by definition, is the 
smallest positive integer which does not occur in C. 

Hence all numbers s of the segment n—m<s<n appear in C, 


and therefore 


C(n) -C(n—m) =m—1. 
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On the other hand, by the lemma on p. 24, since m does not oc- 
cur in C=A+B we have A(m)+ B(m)<m—1. Consequently 


(7) C(n)-C(n—m) > A(m) + B(m) > (a+ B)m, 
which again proves the validity of the fundamental lemma. 


$6 


CANONICAL EXTENSIONS 


We now turn our attention to the case where the sequence 
C=A+B is not normal. In this case we shall add to the set B, ac- 
cording to a very definite scheme, numbers which it does not con- 
tain, and thereby pass from B to an extended set B,. The set 
A+B,=C, evidently will then be a certain extension of the set C. 
As I said before, this extension of the sets B and C (the set A re- 
mains unaltered) will be defined precisely and unambiguously; it is 
possible if and only if the set C is not normal. We shall call this 
extension a canonical extension of the sets B and C. Some impor- 
tant properties of canonical extensions will be derived, with whose 
aid the proof of the fundamental lemma will be completed. 

I now come to the definition of the canonical extension of the 
sets B and C. If C is not normal, there exist two numbers c and c’ 
in the segment (0,n), such that 


cf£C, c "EC, etc'-—neC, 
Since C=A+B, it follows that 
(8) c+c'-n=atb (ae A, beB). 
Let By be the smallest number of the set B which can play the role 
of the number 4 in equation (8). In other words, 8, is the smallest 
integer be B which satisfies equation (8) for suitably chosen numbers 
c#C, c€C, a€A of the segment (0,n). This number f, will be call- 


ed the basis of our extension. 
Thus the equation 


(9) ct+tc'—-n=atBy 


necessarily has solutions in the numbers c, c ‘4, a satisfying the con- 


ditions 
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cf C,c'¢C, aeA, 
where all three numbers belong to the segment (0,7). We write all 
numbers c and c’ which satisfy equation (9) and the enumerated con- 
ditions, to form a set C*. Clearly the sets C and C* do not have a 
single element in common. We call their union* (i.e., the totality 
of all numbers which occur either in C or in C*) 


CUCt=C, 


the canonical extension of the set C. 

Let us now examine the expression B)+7-c If c here is al- 
lowed to run through all the numbers of the set C* just constructed, 
the values of this expression form a certain set B*. According to 
equation (9), every such number By +n—c (c €C*) can be written in 
the form c’—a, where c’€C*, aeéA. 

Let 5* be an arbitrary number occurring in 8*. Since it is of 
the form By+n—-c, it is 28 )20; and since it is also of the form 
c’~a (c’eC*, a €A), it is <c “<n. Hence all numbers of the set B* 
belong to the segment (0,7). Moreover, if b* €B*, then 5* ¢ B, be- 
cause otherwise it would follow from b* =c ’—a that c’=a+b* €A+B 
=C, which is false. 

Accordingly, the set B* is embedded in the segment (0,7) and 
has no elements in common with the set B. We put 


BUB*=B, 
and call the set 8, a canonical extension of the set B. 


Let us show that 
A +B =C). 


First, let a€A, b,€B,. We shall prove that a+ b,€C,. From 
6b, €B, it follows that either 6,€B or 5,€B*. If b, €B, then a+ b, 
€4+B=CCC,. If by €B*, however, then a+b, either occurs in C, 
and hence also in C;, or a+ 6, ¢ C. But in this case (since 5,, as an 
element of the set B*, is of the form Bot+tn-c%4 c’¢C) we obtain 


c=at+b, =at+By +n-c’ EC. 
Therefore 


*Here and in the sequel we use the symbol Uto denote the union of sets, since we 
are using the symbol + in another sense. 
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c+c’-n=a+B,eA+Be=C, 


where c#C and c’¢C. But then according to the definition of the 
set C*, 


e=a+b,eC*CC,, Q.E.D. 


Thus we have shown that A+8,CC,. 

To prove the inverse relation, let us assume that c€C,, which 
means that either c€C or c€C*. If cEC, then c=a+b, a&A, bEB 
CB,. If, however, c €C*, then, for a certain a €A, the number b*= 
c—a, as we know, occurs in B*. We have c=a+ 6* €4+B*CA+B,. 
Therefore C,CA+8,. We also proved above that 4+B,CC,. Con- 
sequently C,=A+B,. 

Now recall that according to our assumption, 2 ¢C. It is easy to 
see—and this is important—that the number n does not appear in the 
extension C,. For if we had n€C*, we could, by the definition of 
C*, put c’=n in equation (9), which would yield c=a+B, EA+B=C, 
whereas c £C according to (9). 

If the extended sequence C, is not yet normal, then, because of 
A+B,=C, andn¢Cy,, the sets A, By, and C, form a triple with all 
the properties of the triple A, 8, C that are necessary for a new 
canonical extension. We take a new basis §, of this extension, de- 
fine the complementary sets BY, C% as before, put 


B,UBF=Bz, C,U CE=C,, 


and are able to assert once more that A4+B,=C, and n€Co. It is 
evident that this process can be continued until one of the exten- 
sions C, proves to be normal. Obviously this case must certainly 
take place, because in every extension we add new numbers to the 
sets BY, and Cy without overstepping the bounds of the segment 
(0,7). 


In this way we obtain the finite sequences of sets 


B=B,cB,c...cB,, 
C=C,CC,c...cC,, 


not appear in 


where every “en (respectively Cus) contains numbers which do 
r p 


(C,) and which go to make up the set B* (C1); so 
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that 
Bi 78, U Bi, Cus =O, UCT (O<spsh-]). 


We denote by 8,, the basis of the extension which carries (By, C,) 
into (Busi Cus) We have 


A+B, =C,, n€C,, (O<spsh). 


Finally, the set C, is normal, whereas the sets Cy, (O<psh-1) are 
not. 
$7 
PROPERTIES OF THE CANONICAL EXTENSIONS 

We shall now formulate and prove in the form of three lemmas 
those properties of the canonical extensions which are needed lat- 
er. Only Lemma 3 will have further application; Lemmas 1 and 2 are 
required solely for the proof of Lemma 3. 

LEMMA l., Bu>Byy (lgpsh-1); i.e., the bases of successive 
canonical extensions form a monotonically increasing sequence. 
. = * . * 

In fact, om By & 8-8, UBL, either B,eBt 4 or Bye 
p=" If By eBR then By is of the form 


By=By1 t+" 6: 


B 


where cEC*_ CC and therefore c<n, so that B >Buy and Lemma 
1 is proved. If By EByaas however, then by the definition of the num- 
ber By, there exist integers a€A, c €C,, c’¢ Cu such that 


c+c’-n=a+B,€ Cy. 
But for Bu E Bunt , we have 


(10) etc’-n=a+B,€A+B, 1 =C,_), 


where c ¢C _) c°¢ Cy. Hence, because of the minimal property 
of Bu- By, 2By_)- If By. =By1; it would follow from (10) and the 
definition of the set Cl that 


* +e CF 
cect _,Ccc,, c ech, Cy. 


Both are false, however, and therefore By > By 
In the sequel we shall denote by m the smallest positive integer 
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which does not appear in C,. 


LEMMA 2. [f ceC* (Ospsh-1) and n-m<c<n, then c>n-m+ 
Bu That is, all numbers c of the set Ch which lie in the interval 
n—m<c<n are embedded in that part of this segment which is char- 
acterized by the inequalities n-m+B,<e<n. 

We have to show that 


c +m-n>B,. 
It follows from n-m<c<n that 
O<m+c—n<m. 
Therefore, by the definition of the number m, 
m+c—neC,, 


Now 
C,=C,U Cy,U Citl U... UCR 4s 


We consider two cases. 
1) Ifm+c-n eC, then 
m+ce-n=at+b,, aéA, b,€ By. 
¢ 
But m eC), and c gC, (the latter because c €C*). Therefore because 
of the minimal property of By we must have b 2B, If 6 =By it 
would follow from the definition of the set C* that m€C*, which is 


false because Cac Cui Cy and m¢C,. Consequently b> By so 
that 


m+c-n=a+b,26,>B,; 


and Lemma 2 is proved. 


2) If c’=m+c—n €C¥ (ugv<h-1), then, by the definition of the 
set C*, c satisfies an equation of the form (9), 


eo ed 
e*-a=fh,+n-c", 


where a€A, c”’EC¥. Hence c’2c’-a>B,2B,, (where the last in- 
equality is given by Lemma 1), and Lemma 2 is again proved. 
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LEMMA 3. We have 
C¥(n)—C¥(n—m) = BY (m—1) (O<psh-D). 


That is, the number of integers c€C* in the segment n~m<c<n is 
exactly the same as the number of integers beBr in the segment 
O0<b<m (of the same length). 


Let us examine the relation 
(11) b=B, +n—c. 


By the very definition of the sets B* and Ch c €C* implies 6 eBi, 
and conversely. If, in addition, n—m +By<e<n, then B,.<b<m, and 
conversely. Hence 


Cin) - Cin —m+ By) = Bim -1) — BYB,,)» 


By Lemma 2, C*(n-—m+B,,)=C*(n-—m). On the other hand, every 
be B* can be expressed in the form (11), where c<n; b therefore ex- 
ceeds Bu» and consequently Bi(B,,)=0. It follows that 


Cin) — Cin —m) = Bim —1), Q.E.D. 


§8 


PROOF OF THE FUNDAMENTAL LEMMA 


It is very easy now to prove the fundamental lemma by proceed- 
ing from the results in §5 and appealing to Lemma 3 which was just 
proved. 

If we apply the result of §5 in the form of inequality (7) to the 
sequences A, B,, and C, (which is permissible because of the nor- 
mality of C,), we find that 


(12) C,(n)—C,(n—m)2 A(m) +B, (m), 


where m is the smallest positive integer which does not occur in 
C,. Obviously m¢A and m¢B,, so that we may write A(m-1) and 
B,(m—1) instead of A(m) and B,(m), respectively. 
We have 
C,=CUC*UCU...UCF_), 


B,=BUB* UB4U...UBt_,, 
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where the sets appearing in any one of these two unions are mutu- 
ally exclusive, so that a 
C,(n)-C,(n—m) =C(n)-Cln—m) + & 1 CU(n)-Ci(n-m)}, 
i eis 
B(m) =B,(m-1)=B(m-1) + > Bikm—); 
H=0 
we have of course put Ch =C*, Bf =8*. On account of (12) it follows 
that ee 
C(n)-C(n—m)+ 2 { Cin) ~ Ch (nm) } 
p=0 
h-1 
2A(m)+B(m-1)+ = B*(m-)). 
poo 
By Lemma 3, however, 
Ci (n) — Ch(n ~m) = Bim — 1) (O<psh-)D, 


so that the preceding inequality becomes 


C(n)-—C(n—m) > A(m)+B(m—1)=A(m) + B(m) 2(a +B) m, 


which proves the fundamental lemma. 

As we saw in § 4, this also completes the proof of Mann’s theo- 
rem which solves the fundamental metric problem of additive num- 
ber theory. 


Doesn’t Artin and Scherk’s construction have the stamp of a 
magnificent masterpiece? [ find the outstanding combination of 
structural finesse and the extremely elementary form of the method 
especially attractive. 


CHAPTER II! 


AN ELEMENTARY SOLUTION OF 
WARING’S PROBLEM 


§1 


You will recall the theorem of Lagrange, which was discussed 
at the beginning of the preceding chapter. It says that every natural 
number can be expressed as the sum of at most four squares. | also 
showed you that this theorem could be stated in entirely different 
terms: If four sequences, each identical with 


(A 4) 0, 1; oe eee k?, oosy 


are added together, the resulting sequence contains all the natural 
numbers. Or even more briefly, the sequence (A 2) is a basis (of the 
sequence of natural numbers) of order four. I also mentioned that, as 
had been shown later, the sequence of cubes 


(A 3) 0, 13, 23, ..., 4, 


was a basis of order nine. All these facts lead in a natural manner 
to the hypothesis that, for an arbitrary natural number n, the se- 


quence 
(A,) OI 28 ces Rte 


is a basis (whose order of course depends on nr). This conjecture 
was also actually propounded by Waring as early as the eighteenth 
century. The problem proved to be very difficult, however, and it 
was not until the beginning of the present century that the universal 
validity of Waring’s hypothesis was demonstrated, by Hilbert (1909). 
Hilbert’s proof is not only ponderous in its formal aspect and based 
on complicated analytical theories (multiple integrals), but also 
lacks transparency in conceptual respects. The eminent French 
mathematician Poincaré wrote in his survey of Hilbert’s creative 
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scientific work, that once the basic motivations behind this proof 


were understood, arithmetical results of great importance would 
probably flow forth as from a cornucopia. In a certain sense he was 
right. Ten to fifteen years later, new proofs of Hilbert’s theorem 
were furnished by Hardy and Littlewood in England and by I. M. 
Vinogradov in the USSR. These proofs were again analytic and for 
mally unwieldy, but differed favorably from Hilbert’s proof in their 
clarity of method and their conceptual simplicity, which left nothing 
to be desired. In fact, because of this, both methods became mighty 
scources of new arithmetical theorems. 

But when our science is concerned with such a completely ele- 
mentary problem as Waring’s problem, it invariably attempts to find 
a solution which requires no concepts or methods transcending the 
the limits of elementary arithmetic. The search for such an elemen- 
tary proof of Waring’s hypothesis is the third problem which I should 
like to tell you about. Such a fully elementary proof of Hilbert’s 
theorem was first obtained in 1942, by the young Soviet scholar Y. 
V. Linnik. 

You are already accustomed to the fact that “‘elementary’’ does 
not mean “‘simple’’. The elementary solution of Waring’s problem 
discovered by Linnik is, as you will see, not very simple either, and 
it will take considerable effort on your part to understand and digest 
it. | shall endeavor to make this task as easy as possible for you 
through my exposition. But you must remember that in mathematics 
(as probably in any other science) the assimilation of anything real- 
ly valuable and significant involves trying labor. 

The ideas of Schnirelmann which I described to you in the begin- 
ning of the second chapter play an essential role in Linnik’s proof. 
You will recall ([ mentioned it at that time) how Schnirelmann 
proved his famous theorem that the sequence P consisting of zero, 
unity, and all the primes, is a basis of the sequence of natural num- 
bers: He showed that the sequence P+P has a positive density. 
This immediately yields the assertion, however, because, according 
to the general theorem of Schnirelmann which we proved on pp.24- 
25, every sequence of positive density is a basis of the sequence of 
natural numbers. The same method also lies at the basis of the proof 
of Hilbert’s theorem discovered by Linnik. It all boils down to the 
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proof that the sum of a sufficiently large number of sequences (A,) 
is a sequence of positive density. As soon as this is accomplished, 
we can, by virtue of the same general theorem of Schnirelmann, re- 
gard Hilbert’s theorem as proved. 


§2 
THE FUNDAMENTAL LEMMA 


If we add together & sequences, identical with A,, according to 
the rule in Chapter II, we evidently obtain a sequence A‘*) which 
contains zero and all those natural numbers which can be expressed 
as a sum of at most & summands of the form x”, where x is an arbi- 
trary natural number. In other words, the number m belongs to the 
sequence A‘*), if the equation 


n n n 
(1) KqtXoten.tXp=m 


is solvable in nonnegative integers x, (1¢gi¢k). As we saw in §1, 
the problem is to show that, for sufficiently large k, the sequence 
A‘*) has a positive density. 

For preassigned & and m, equation (1) in general can be solved 
in several different ways. In the sequel we shall denote by r,(m) the 
number of these ways, i.e., the number of systems of nonnegative 
integers X4,Xo+--,x, which satisfy equation (1). It is clear that the 
number m occurs in A‘*) if and only if r,(m)>0. 

In the following, we shall assume the number n to be given and 
fixed, and shal] therefore call all numbers which depend only upon 
n, constants. Such constants will be denoted by the letter ¢ or c(n), 
where such a constant c may have different values in different parts 
of our discussion, provided merely that these values are constants. 
Perhaps you are rather unused to such “‘freedom’’ of notation, but 
you will soon become familiar with it. It has proved to be very con- 
venient, and appears more and more frequently in modern research. 

FUNDAMENTAL LEMMA. There exist a natural number k=k(n), 
depending only on n, and a constant c, such that, for an arbitrary 
natural number N, 


(2) r(m)<eN ei 


(lems). 


Once more, as in the preceding chapter, we are faced with two 
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problems: first, to prove the fundamental lemma, and second, to draw 
from the fundamental lemma the conclusion that we need, viz., that 
the sequence Atk) has a positive density. This time again the sec- 
ond problem is considerably easier than the first, and we shall there- 
fore begin with the second problem. 

It follows immediately from the definition of the number 7,(m), 
that the sum 


r,(0) +7, (1) +... +7, (N) =R,(N) 


represents the number of systems (x4, Xa, «2+, X,) of & nonnegative 


integers for which 
(3) apt Kote +x, SN. 


Every group of numbers for which 
Osx, <(N/k)1/" (1<i<k), 


obviously satisfies this condition. To satisfy these inequalities, 
every x, can evidently be chosen in more than (N/k)}/" different 
ways (x,=0, 1,...,((N/k)1/"]).* After an arbitrary choice of this 
sort, the numbers %4,%»,...,x, may be combined, and so we have 
more than (N/k)*/" different possibilities for choosing the complete 
system of integers x, (1<i<k) so as to satisfy condition (3). This 
shows that 


(4) R,(N) 2(N/K)*/". 


We assume that the fundamental lemma has been shown to be 
correct, and that inequality (2) is satisfied for an arbitrary V. We 
now have to verify that inequality (2) is consistent with inequality 
(4) which we proved, only if the sequence Atk) has a positive den- 
sity. The idea behind the following deduction is very simple: In the 
sum R,(N), only those summands r,(m) are different from zero, for 
which m occurs in A‘*), If A(*) had density zero, then for large N 
the number of such summands would be relatively small; because of 
(2), however, every summand cannot be very large. Their sum R,(N), 
therefore, would also be relatively small, whereas according to (4) 
it must be rather large. 


* [a] denotes the largest integer S a, 
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It remains to carry out the calculations. Suppose that d(A‘*))=0. 
Then, for an arbitrary small ¢>0 and a suitably chosen N, 


AYN) <eN, 


Here the number N may be assumed to be arbitrarily large, because 
A‘*) (for an arbitrary k) contains the integer 1 (bear in mind Problem 
6 on p.22, which you solved). Applying the estimate (2) we get 


n N 
R,(N)= % ry ln) =1,(0) + oF r(m)< 1+ eNk/ni-] ACK) (N)< l+ceN (F/2) 


and hence, for sufficiently large N, 


RW) < QceN*/™, 
For sufficiently small e, 
Qee<(1/k)E/", 
so that 
RAN) < (N/K)E/", 


which contradicts (4). Therefore we must have 
d(A tk N>0. 
But, as we already know, this proves Hilbert’s theorem. 


You see how simply it all comes out. But we still have to prove 
the fundamental lemma, and to do this we shall have to travel a long 
and difficult road, as in the preceding chapter. 


§3 


LEMMAS CONCERNING LINEAR EQUATIONS 


We shall have to go far back. It will therefore be well for you to 
forget completely for a while the problem which has been posed. | 
shall call your attention to it when we return to it later. 

Right now, however, we have to find some estimates for the num- 
ber of solutions of systems of linear equations. The lemmas of this 
paragraph, moreover, are perhaps also of intrinsic interest, inde- 
pendent of the problem for whose solution they are required here. 


LEMMA 1. In the equation 
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(5) 4424+ Gozo =m, 


let ay, aa m be integers with \a2|<|a1|<A, and let a, and az be 
relatively prime. Then the number of solutions of equation (5) sat- 
isfying the inequalities |z4|<A, |z2|<A, does not exceed 3A/|a,|. 
Proof: We may assume that a,>0, because otherwise we have 
merely to replace z, by —2z, in every solution. 
Let {z1, zo} and {zj,z3} be two different solutions of equation 


(5). Then from 


442, + @ozo=m, 
@4Z{+aoz5=m 


we get 
ao(zh—Z2) = a4 (z,-z}) 


by subtraction. Accordingly the left-hand side of this equation must 
be divisible by a,. But* (@,,@2)=1, and consequently z3-—z~2 must 
be divisible by a,. Now z3#2Zo, and therefore |z3-—z|, as a multi- 
ple of a4, is not smaller than a,. Thus, for two distinct solutions 
{z1, Zo and {z{, z3} of equation (5), we must have |z3—z.|2¢;. 

In every solution {z,, z2} of equation (5), let us agree to call z, 
the first member and Zz». the second. It is obvious that the number of 
solutions of equation (5) which satisfy the conditions |z4|<A, 
|z|<A, is not more than the number ¢ of second members which oc- 
cur in the interval <~—A, A>. Since we have proved that two such sec- 
ond members are at least the distance a, apart, the difference be- 
tween the largest and smallest second members occurring in the in- 
terval <—A, A> is at least a,(t—1). On the other hand, this difference 
does not exceed 2A, so that 

a,(t—1)<2A, 
(t-1) <2A/a,, 
t<(2A/a,)+1<3A/a, 


(because, by assumption, @,<¢A, and therefore 1<A/a,). This proves 
Lemma 1. 

LemMA 2, In the equation 
(6) Q4Z,+ GozZot... +a)z,=m, 


* (a4, a2) denotes the greatest common divisor of the integers a4 and ag 
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let the a, and m be integers satisfying the conditions* 
la,|sd (gis), (ay, a9,...,@)=1. 


Then the number of solutions of equation (6) satisfying the inequal- 
ities |z,|¢A (lgisD, does not exceed 


el) A*! /H, 


where H is the largest of the numbers |a,|,|a|,..., |@,|, and c(l) is 
@ constant depending only on l. 


Proof: If !=2, Lemma 2 obviously becomes Lemma 1 (with c(2) = 
3). Accordingly Lemma 2 is already verified for /=2. We shall there- 
fore assume that /2>3 and that the truth of Lemma 2 has already been 
established for the case of /—1 unknowns. Since the numbering is 
unimportant, we may assume that |a,| is the largest of the numbers 
laa|, |@al,---r la], i-e., H=|a,|. 

There are two cases to consider. 

1) a, “ag%... =a,_,=0. Since (a3, ao, ...,a))=1, we have {a,|=H= 
1, so that the given equation is of the form +z,=™m. In this equation 
each of the unknowns 2,4, Za,.-.; Z}., can obviously assume an arbi- 
trary integral value in the interval <-d,A>, and hence at most 
2A +1<3A values all told. As for z,, however, it can assume at most 
one value. Consequently the number of solutions of the given equa- 
tion satisfying the inequalities |z,|<A(1<i</J), does not exceed 


(3A)'-! =c(l) A) = c(l) A’-1/H, 


which proves Lemma 2 for this case. 
2) If at least one of the numbers ay, a2,...,@,_, is different from 
zero, then 


(a), Go, .++) @.,) =5 
exists. Let us denote by 1’ the largest of the numbers 
ja\/5 (gigl-1). 


Suppose now that the numbers 2,, 22,...,2, satisfy the given e- 
quation (6) and the inequalities |z,|¢A (1<i</). We set 


(7) (a4/5) 24 +(ao/5)zo+...+(a,_,/5) 2,.,= m4 


*(a,, Go» «00, 27) denotes the greatest common divisor of the integers in parentheses. 
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and hence 
AyZ4t+AoZoten +a) )2,.,=5m% 
Then obviously 
(9 5m’+a,z,=m 
and 


t-1 
Sm‘i< > lallelsloHA, 


which implies that 


<lH’A. 


| m’ 


Thus, if the numbers 24, 2Z2,...,2, satisfy equation (6) and the ine- 
qualities |z,|<A (1<i</), then the integer m’ exists, which, with 
these numbers, satisfies equations (7) and (8), where |m|<lH’A. 
But in equation (8) clearly 5<|a,| and (6, a,)=1 (otherwise we should 
have (a4, d,..., @)_,,@,)>1). Hence, by Lemma 1, the number of so- 
lutions of equation (8) (in the unknowns m’, z,), for which |m“|< 
lH “A, \z,{<¢A<JH‘A, does not exceed 3/H ‘A/|a,|. For the same m‘, 
equation (7), according to Lemma 2 for equations in /—1 unknowns, 
has. at most c(JA!-2/H ’ solutions in integral z; with |z,|<A. 

It is evident, from what has been said, that the number of solu- 
tions {z,,Z9,...,z,} of equation (6) which satisfy the inequalities 
|z,|<A Ogig¢d), does not exceed 


(31H “A/|a,|) ()A2/H “= e(1)A!“1/|a,| 
=c()A"1/H, 


which completes the proof of Lemma 2.* 
We shall now investigate the totality of equations of the form 


(9) G4Z714+AoZo+...+a,2,=0, 


where |a,|<A (1<i</J) and, as always, all a; are integers. Let B be a 
positive number whose relation to the number A is described by the in- 
equalities 1<A< B<c(l)A‘-!, and let >2. We now want to estimate 


*You have probably noticed that in the last chain of equations the symbol c(l) oc- 
curred in different places with different meanings. On p.39 1 prepared you for such a use 
of this symbol. 
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the sum of the numbers of solutions z;, |z;|<¢B (1<i</J) of all the e- 
quations (9) of this family. 

1° First let us make a separate examination of equation (9) for 
a@4=Q,=...=a,=0 (it is a member of our family) and estimate the 
number of its solutions which satisfy the inequalities |z,|<B (1< 
i</). Our equation is obviously satisfied by an arbitrary system of 
numbers 24, Z9,...,2);, and we have merely to calculate how many 
such systems exist which satisfy the inequalities |z,|< 8, |z2|<8, 
.-y|Z,|$B. Since the interval <-B,+B> contains at most 2B +1 in- 
tegers, each z, can assume at most 2B +1 different values. Conse- 
quently the number of systems {z4, Z2,..., z,} of the type in which 
we are interested does not exceed (2B + 1)'<(3B)'=c(/)B". By our hy- 
pothesis, however, B<c(l)A‘-!, so that c(J)Bt=c(l)BY-1B < c(l(AB)* 
Hence, for the case where @,=@2=...=a,=0, equation (9) has at 
most c(J)(AB)'-! solutions of the type we are interested in. 

2° Even if only one of the coefficients a, is different from zero, 
the greatest common divisor of these coefficients, (a;, 2o,-..,4,)=5, 
exists. Suppose first that 5=1, and let be the largest of the num- 
bers |a,| (¢=1, 2,...,/). Clearly 7 is one of the integers in the inter 
val <1,A>. Hence, 7 is either between A and A/2, or between 4/2 
and A/4, or between A/4 and 4/8, etc. It is therefore possible to 
find an integer m20 such that 


(10) A/2™+1 <H <A/2™, 


According to Lemma 2, for an equation of the form (9) in which 
6=1 and H satisfies the inequalities (10), the number of solutions 
z,, |z;|<$8, does not exceed 


(J) BY1/H < c(1)B*-1/(A/2™+1) = c(1)BY12"/A., 
On the other hand, it follows from (10) that 
(11) |a,|<A/2™ (sisi). 


Consequently the number of equations of type (9) for which the ine- 
qualities (10) are satisfied is at most equal to the number of equa- 
tions of the same type which satisfy the conditions (11), i. e.,atmost 


(2(A/2™) + 1)4<(34/2™)! = c(1)Ata-mé, 


Thus the sum of the numbers of solutions |z,;|<¢B of all such e- 
quations of type (9) for which 5=1 and A2-(*+1) <H<A2-™, doesn’t 


exceed 
(e(1)B'-12™/A ). c()A!2-™! = c(1)(AB)P-19-(-Lm, 


Summing this estimate over all m20, we reach the following con- 
clusion: The sum of the numbers of solutions |z,|<¢8 of all equa- 
tions (9) for which |a,|¢A (1¢gi¢l) and d=1 is at most 


e(2)(AB)!-}, 


3° It remains for us to figure out the numbers of solutions of the 
required type for equations with 5>1. In this case equation (9) is 
evidently synonymous with the equation 


(a4/8)z 4 +(a2/8)z o+...+(a,/8)z =0, 
where only 
(a,/5,a,/5,...,4,/5)=1 


and the number A has to be replaced by the number 4/35. As we saw 
in 2°, the sum of the numbers of solutions |z,|¢8 of all such equa- 
tions, for a given, fixed 5, does not exceed* 


e(1) (A8-!.B)!1 = ¢(1)(4B)t-18-C-1), 


Clearly now we have merely to sum this expression over all the pos- 
sible values of 5 (1<5< A). 

Thus we find that the sum of the numbers of required solutions 
of all equations of the form (9), where |a,|¢4 (1si¢J) and not all 
a; are equal to zero, does not exceed the value 


A 
oD (AB)E Z5-OD <e(D (AB)**1.21 e(1)(AB)-2, 
=] _ 


[To obtain the first relation we employ the inequality 


A 
= (1/n9t1)<(q¢+1)/q, 


n=] 


*Since instead of A we now have to take the smaller number A/6, it is conceiv- 
able that the assumed condition B<c(l)A"™ is violated. You can verify, however, 
without any trouble, that we made no use of this assumption in Case 2°, and that the 


result in 2 therefore does not depend on it. 
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which is valid for an arbitrary natural number gq and for an arbitrary 
Az>1 (we denote by q the number /—2, which is positive because we 
assumed that />2). Here is a simple proof: For n21 we have 


n-t —(n41)°%={(n4+1)%—n@}/nU(n4+1)2 
An? + gn") +...41—n2)/nU(n+1)¢ 
> gn9-1/n9(n+1)%>q/(n+1)9*}, 


and hence 


(n+1)-@tl) <g-ln-@—(n+1)°9%}. 


By substituting successively n=1,2,...,4—1 in this inequality and 
adding all the resulting inequalities together we find that 


A 
ra) < ”q 1(1-A-@%)<1/ q> 
which implies that 


A 
2 n-at})<1+(1/q)=(q+ 1)/q, Q.E.D.] 
a= 

Comparing this with the result in 1°, where we obtained an esti- 


mate for the case a; -a2=...=a,=0, we reach the following conclu- 
sion: 

LEMMA 3. Let I>2 and 1<AgB<c(l)A'-!. Then the sum of the 
numbers of solutions |z,|<B (lsis/) of all equations of the form 


(9) Q4Z44+GoZQ+...+4),2,=0, 
where |a,|<A (1gi<J, does not exceed 
e(1(AB)*}. 


§4 


TWO MORE LEMMAS 


Before proceeding to prove the fundamental lemma, we have to 
derive two more lemmas of a special type. They are both very simple, 
in idea as well as in form, and yet their assimilation might cause 
you some difficulty because they are concerned with the enumeration 
of all possible combinations, whose construction is rather involved. 
The difficulty with such an abstract combinatorial problem is that it 
is hard to put it in mathematical symbols: one has to express more in 
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words than in signs. This is of course a difficulty of presentation, 
however, and not of the subject itself, and I shall take pains to out- 
line all questions that arise, and their solution, as concretely as 
possible. 

We shal] denote by A a finite complex (i. e., collection) of num- 
bers, not all of which are necessarily distinct. If the number @ oc- 
curs A times in the complex A, we shall say that its multiplicity is 
A. Let a4, ao,...,a, be the distinct numbers which appear in A, and 
let A4,Ag,...,A, be their respective multiplicities (because the com- 


plex A contains all together 2A; numbers). Let B be another com- 
i= 


plex of the same type, which consists of the distinct numbers ), , 
bo,...,6, with the respective multiplicities yz, p12, veer Hye 
Let us investigate the equation 


(12) x+y=c, 


where ¢ is a given number and x and y are unknowns. We are inter- 
ested in such solutions {x,y} of this equation in which x is one of 
the numbers of the complex A (abbreviated x €A) and y is one of the 
numbers of the complex B (y €8). If the numbers x=a, and y=6, sat- 
isfy equation (12), this yields A,u, solutions of the required kind, 
because any one of the A, “‘specimens”’ of the number a@,, which oc- 
cur in the complex A, can be combined with an arbitrary one of the 
#, specimens of the number 5, appearing in the complex B. But we 
have* A,u, $4(A?+u2). Therefore the number of such solutions of 
quation (12), where x=a,, y=b,, is not greater than 14(A? +2). It 
follows that the number of all solutions x€A, y€B of equation (12) 
is not more than the sum 24(A? +7). Here the summation is over all 
pairs of indices {i, k} for which a,+b,=c. Our sum is enlarged if we 
sum A? over all i and #2 over all & (because every 5, can be com- 
bined with at most one a,.) It finally follows, therefore, that the num- 
ber of solutions x€A, y€B of equation (12) does not exceed the 
number 


7 ( EM + = pk ). 


***The geometric mean is not greater than the arithmetic mean’’. Here is the sim- 


plest proof: 
O<A,—1,)?=A2 +uZ-2A,y4,, and hence QA py SA? +42. 
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On the other hand, let us consider the equation 
(13) x-y=0 


and calculate the number of its solutions x €A, y €A. Clearly every 
such solution is of the form x=y=a, (l<ic<r). For a given i we ob- 
tain A? solutions, because the numbers x and y can coincide, inde- 
pendently of one another, with any one of the A, specimens of the 


number a, appearing in A. Accordingly the total number of solutions 
x€A, y€A of equation (13) is equal to ZAP In exactly the same 
way we find, of course, that the number of solutions x€B, yeB of 
the same equation is equal to Sy? If we compare these results with 
the one found above, we reach the following conclusion: 

LEMMA 4. The number of solutions of the equation 


xt+y=c, x€A,yeB 


does not exceed half the sum of the numbers of solutions of the e- 
quations 


x-y=0, xe€A,yed 
and 
x-y=0, xe€B, yeB. 


For the special case in which the complexes A and B coincide 


we obtain the following 


COROLLARY. The number of solutions of the equation 
x+y=c, x€A,yeAd 
does not exceed the number of solutions of the equation 


x-y=0, xeA, yeA. 


Now let & and s be two arbitrary natural numbers. We put £-25=1, 
and investigate the equation 


Xqt+Xot+ eon +X7=C. 


Let Az,Ao,...,4, be finite complexes of numbers. Suppose that 


the complex A; (lgi¢J) consists of the distinct numbers Gp Gidints 
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with the respective multiplicities A;,,A,;,,.... We are interested in 
the number of solutions of the equation 


(14) Xq+Xogtwu.te,=c, x, 6A; (legis). 
If we set 
My tater tX;o =X, ZX is2Hptest =y 


({/2 is of course an integer), then the given equation can be written 
in the form 


x+y=Cc, 


and Lemma 4, which we have just proved, can be applied to it. We 
have only to find out to which complexes the numbers x and y be- 
long. Since x; €A, (1<i<l), x can be an arbitrary number of the form 
Zyt+Zotere+Zyyo, where z;,€A,; (1<i<l/2). Similarly y can be an ar- 
bitrary number of the same form, where, however, 2; €A (1/94; 
(l<isl/2). 

Hence, by Lemma 4, the number of solutions of equation (14) 
does not exceed half the sum of the numbers of solutions of the 


equation 
(15) x-y=0 
under the following two hypotheses: 


1) K=ZztZotentZyyos 
id td a 
YZ tZatewtZyroy 


where 
(16) z,€A,, z;€A, (lgigl/2), 
2) x and y have the same form, but 
(17) ZEA Uyayir 2A ayes (1SESI/2). 


In both cases equation (15) may be rewritten in the form 
(18) (24-z7)4+(z9-22)+...4+(2),,9—-Zy/2) = 0. 


We conclude therefore that the number of solutions of equation (14) 
does not exceed half the sum of the numbers of solutions of equation 
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(18) under the hypotheses (16) and (17), i.e., it does not exceed 
half the sum of the numbers of solutions of the equations 


{1/2 
(18a) 2 (2 ;-2,)=0, z,€A,, zed. (l<gi¢l/2) 


3 3 8 
and 
1/2 
(18 b) Zlz;-27)=0, 2€A ayayep z€A yoya; (l<gisl/2). 


Equation (18) has //2 summands on the left-hand side, i.e., half 
as many as the original equation (14). 


We set 
Ele,-2)) S° =) 
Z2.-2.}/= 2.—-Z2.)=Y; 
Pal ia iad: i=(i/4)t1 * # y 


and thereby bring equation (18) into the form 
x+y=0. 


To this we can apply Lemma 4 anew. It is evident that, just as we 
arrived at equation (18) from equation (14), we now get from equation 
(18) to the equation 


1/4 
(19) Zlugtujy—us uj" =0, 


where we have to consider the sum of the numbers of solutions of 
this equation under the following (now four) hypotheses: 
1) uj, uj, uy uy” €A;, 


eo ae aaa 
2) uu uy u; EA cry tis 


ed on ii ad (l<i l 4 
3) uj, Uy Us Uy; EA yaya sigl/4) 


id ae add 
4) ug uy uss Uy” EA gig aye 


Since /=k-25, we can repeat this process s times. We evidently end 
up then with the equation 
k 
é=] 
where we have to consider the sum of the numbers of solutions of 
this equation under 2° different hypotheses, viz.: 
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1) WeAy WEA wry edy, 
2) WEA, 1, EA, por er YEA, 


(1<j<2s) 
2°) YP CA, os paqrens Yh CAzags 
If we put 
yDaWD+ yd +. 4%) (1</<25), 
then equation (20) takes on the simple form 
(21) yD gy gyl2 yy (29) 20, 


Here we are concerned with the sum of the numbers of solutions of 
equation (21) under the following 2° hypotheses, which differ from 
one another in the value of the parameter w (0<w< 25-1): 


ae ire raey UP 
where 
yi?) cA ok ‘+1 yd?) CA. og yes juae yo? CA aye (j=1, Dwenta?): 

Thus we can express the final result of our deduction in the form 
of the following proposition: 

LEMMA 5. [f l=k-28, the number of solutions of the equation 
(14) KytXoteurtx=c, «6A, (1sigl) 
does not exceed the sum of the numbers of solutions of the equation 


“1 ‘A 
(21) ye yl2g gy (2? Vu y(2P41)_ (2°) 50, 


. ; } JH Ly, Sy wees 
YEA par WP EA pg goe oer MEAG 1 yk 


under the hypotheses w=0,1,..., 25-1. 


Notice the connection between Lemma 4 and Lemma 5 for 
k=s=1, l=2, 

This winds up our preliminaries, and we are ready now to begin 
the direct assault on the fundamental lemma. 
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§5 


PROOF OF THE FUNDAMENTAL LEMMA 


We are going to prove the fundamental lemma by the method of 
induction on n. It is often the case in inductive proofs, that a 
strengthening of the proposition to be proved, considerably facili- 
tates its proof by the given method (and sometimes is actually what 
makes the proof feasible in the first place). The reason for this is 
easy to understand. In inductive proofs, the proposition is assumed 
to be correct for the number n-—1, and is proved for the number n. 
Hence, the stronger the proposition, the more that is given to us by 
the case n—1; of course, so much the more has to be proved for the 
number n, but in many problems the first consideration turns out to 
be more important than the second. 

And so it is, in fact, in the present case. Of immediate interest 
for us is the number of solutions of the equation x3+x3+...+xp=m 
(lg¢m<N) (where, according to the very meaning of the problem, 
Ogx,sm)/"<N}/n), But x” is the simplest special case of ann-th 
degree polynomial 


= n n-l 
fix) =a9x +4,x"°"+...4+@, )x+a,, 


and it will be to our advantage to replace the given equation (1) by 
the more general equation 


(22) flxs)+ flxg)+...+ flx,) =m, 


where the unknowns are subjected to the weaker conditions |x,| < 
N1/" (1<i¢k). The proof of our proposition for equation (22) will 
give us more than we really need; but, as you will see, it is just 
this strengthening of our proposition which creates the possibility 
of an induction. And so, for m<N, let us denote by r,(m) the number 
of solutions of equation (22) which satisfy the conditions |x,|<N!/" 
(1<i<k). Of course we are still free to dispose arbitrarily of the co- 
efficients of the polynomial f(x) in the interest of the induction to be 
performed (provided only that the imposed conditions are satisfied 
in the case f(x)=x"). We are going to prove the following proposition: 


Let the coefficients of the polynomial f(x) satisfy the inequalities 
(23) la,|<e(n)N#/™ = (Osign). 
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Then, for a suitably chosen k=k(n), 
r,(m)<c(n)N&/")-1 (lemQN). 


Since the inequalities (23) are obviously satisfied in the case 
fix)=x" for c(n)=1, this theorem is indeed a sharpening of our fun- 
damental lemma. 


Let us first consider the case n=1, fix)=acxv+a,. We set k(1)=2, 
so that equation (22) acquires the form 


Qo(x1+x2)=m—2a,. 


We are interested in solutions of this equation which satisfy the re- 
quirements |x4|<N, |x| <N. Thus at most 2N+1<3WN values are pos- 
sible -for x,. But at most one x2 corresponds to every %;, so that 


r a(m)<3N, 


which completes the proof of our proposition for n=1 (k=2). 
Now let n>1, and suppose that our assertion has already been 
verified for the exponent n—1. Put A(n—1)=k’and choose 


k=k(n)=2n.2'4 logok ‘1, 


where the exponent means the greatest integer not exceeding 4log ok.’ 
In the sequel we shall set [4log.s1—1l=s, for brevity, so that 


(24) k=2n-2511, 


To estimate the number, r,(m), of solutions of equation (22), we 
first apply Lemma 4 to it, setting 


%k k 
Bf I> Fuaafed 


The complex A (and the complex B which coincides with it in 
this case) consists of all sums of the form. 


Kk 
Ze), where |x| N1/" (sig %k). 


By the Corollary of Lemma 4, r,(m) does not exceed the number of 
solutions of the equation x—y=0, where x €A, y EA, i.e., 


Do 


k/2 k/2 
n= Efe), y= 2 fly) 


|x,|SN1/", ly) <N1/™ (1<ick/2). 


In other words, r,(m) does not exceed the number of solutions of the 
equation 


k/2 
(25) SUfle)-f)'=0, 


where |x,|<¢N1/*, ly|<N1/" (lgigk/2). We now set x,-y,=h, 
(1gigk/2) and replace the system of unknowns {x,,y,} by the sys- 
tem ly;,h,}; here we allow y; and h, (1gi¢k/2) to assume all pos- 
sible integral values in the interval <-2N1/",+2N!/">, which can 
only increase the number of solutions of our equation. This means 
that every summand f(x,)—f(y,) in equation (25) is replaced by the 
expression 


n-l 
fly +hd-flyd= Bal, +h)? -y7"} 
n-l n-v 
ss na—-v Ca Nev-t 
= 2 oe 2! chp . 


If we change the variable ¢ of summation by putting 


v+l=u, 
so that 
n-v-t=n-u, t=u-v, 
we obtain 
n-l n 
fy, th-fy=h; Za, E (Bey nyelyne 
Ss Ge 1 
ah, zy" seyte a 
n 
=h, x tie holy), 
where 


b= Za," 
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is a polynomial of degree n —1 with coefficients 


u-l 
a, ,= 2a, (R-V)AYY) (lsisk/2) 
" v=0 
which depend on the numbers 4,. 

Thus, in the new variables ly, h;}, equation (25) assumes the 
form 


(26) hidslys)+Roholya)+.thyp Py ply yy) =O. 


In this equation the numbers A, and y,; may take on arbitrary in- 
tegral values in the interval <—2N1/n,42N1/">) where we must bear 
in mind that the coefficients of the polynomials ¢,(y) (of degree n—1) 
depend on the numbers h. 

Mark well that we have proved the following so far: The number 
r,(m) which we are estimating, does not exceed the sum of the num- 
bers of solutions in integers y;, |y;|$2N1/" (1<gigk/2), of all the 
equations (26) which can be obtained from all possible values of the 
numbers h,, |h;|¢2N1/" (1gigk/2). 


§ 6 


CONTINUATION 


We are now going to examine one of the equations (26), i.e., we 
shall regard the numbers 4, (1<i¢k/2) for a while as fixed. Let us 
apply Lemma 5 to this equation; the numbers A,¢ ,{y;) play the role 
of the unknowns ~x,, the number Vk =2n.25 plays the role of the num- 
ber /, and we set 2n=ko for brevity. Recall once more that the num- 
bers h, appear in equation (26) not only explicitly but also through 
the coefficients of the polynomials bly). The complex A, to which 
the numbers x, =ho,ly,) must belong consists, in the present case, 
of all numbers of the form 2,,(y,), where the numbers /, have given, 
fixed values and the numbers y, run through the interval <—-2N!/", 
+2N1/n>, 

According to Lemma 5, the number of solutions of equation (26) 
satisfying the requirements just described, does not exceed the sum 


of the numbers of solutions of the equation 
-l si] 
(21) yg yl 4 gy (2? )_y (2° HD) ny (270 


under the following 2° hypotheses which correspond to the values 


o7 


of the parameter u =0, 1,..., 25-1: 


V) a 7) (i) (7) 
y =J¥1 +¥9 + iéisc +p > 
i (1<j<2°), 


ye (l<ei<ko) 


wkoti 


where, remember, A, (1<rg¢2*) is the complex of numbers of the 


form /,,(y,)with prescribed h, and arbitrary y,, |y,| $2N1/n, 
For the case w=0 (which we choose merely as an example), 


equation (21) in expanded form looks as follows: 


ty tyd 4... +960} 
ely Pay f+ ee yI 
os Oe ce 


s-l s-l s-l 
s-l sel s-l 
— by FD gy he Fs ey l2 FU 


s 
— fy 4 yf20 400, +942) = 0, 


or, rearranging the summands, 
=] ms 
fy) py f2) 4, ey fOr y (2? OD) 0 (2°) 
s-l -1 Ss 
t hyd py fe... ty df? Vay f2° Ot) yf 2D} 
tee « 


s-l s-l s 
+ yg)? +o? +... +9? ie *)) — yl? )}=0; 


every one of the numbers y~” is a number of the form hb (vo), 
where || <2N1/", Hence the last equation can be rewritten in the 


form 
habbo?) 46 (of ee dy(vf27 2) <b (v2 91D). fof? 
thd dof) +...-polof? 4... +0y b Piola Moe hy (2) =0. 


By putting, for brevity, 
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GAY) gh (VAY Blyle* ) —Glyl25 4D) gb (vl?) =z, 
(1lgigko) 


this equation can be written quite shortly as follows: 
(27) hy2y + hote tenth, % =0. 


All together we have 2° equations of this sort, and their totality can 
be written down in the compact form 
ko 


Lh 


For the present, however, we shall confine our investigation to e- 
quation (27), which may of course be regarded as typical. To esti- 
mate the number of solutions of this equation which interest us, we 
must first see within what limits the quantity ¢,(uy‘/)) can vary. To 
this end we recall that (p. 55) 


(y= 2a. 
where 


u-l 
ae 2 aC age (l<igk/2). 
Hence it follows from our hypotheses |a,,|<c(n)N”” and [A,|S2N “thet 


u-1 6 
la; ul< 2% clay?’ impo nN (ue) Y/nae (n)N(w-1)/m S (Boe) ; 
i. e., in view of ugn, 
(28) Ja, y|<eln) NCH D/n, 


On the other hand, because of | vf) <2N1/" we have |v()|"-* <c(n) : 


N&-w)/n and consequently 


|a, ule leP]2 4g c(n) NOH 1/2 Nu) / m2 6(n)N OO D/ 9, 


The same estimate (with another c(n)) holds for the whole 6), 
since the number of terms of this polynomial is equal to n. Accord- 
ingly 

lp (uP) <cln)NO@D/" (sicko ls j<25). 
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But every z, is the sum of 25 =c(n) summands of the form th (vi), 
and therefore 


|z,\<e(n)N™1)/"  (1sisko) 


(with another c(n), naturally). This means that in equation (27) every 
z, can assume only the values lying in the interval <-e(n)N(-1)/n, 
+e(n)N(n-1)/any, 

Let m be one of these numbers. The equation z ,=m can be sat- 
isfied in general not only in one but in several ways, because the 
definition of the number z,; (p.58) is such that one and the same val- 
ue of z, can very well result from different choices of the numbers 
vii) (1g j¢2). We now have to estimate the number of solutions of 
the relation z,;=m, i.e., of the equation 


(29) $,(vf))+... +h, (vi2°™ yD) —$,(v{2°"1+1 )) oh (yl? ) =it. 


For this purpose we shall finally have to apply the long-promised 
induction. We proceed as follows. 
First we rewrite equation (29) in the form 


p (vl ))) +p (vl?) +... +6 (ol °) 
=m — (ult 4D), 4h (ol? FD) 4.02), 


This is possible because for &’=k(n—1)>1 (and we have seen that 
already 4(1)=2) we have 


perk ol4 log ok’ ]-2 ee 
(In detail: “22, logok ‘21, 3logek’23, [4logok’1—2>4logok "32 


logok’, gs-1 agit leaak es pey 
If we denote the right-hand side of the last equation by m‘, we 
get 


(30) $ (of) + ove +h (v6F)) =m’, 
Let us choose some particular values for the numbers 


(in the interval <~2V!/",+2N1/">, naturally); then m’also acquires 
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a definite value. To equation (30) we now apply the theorem to be 
proved, since ¢,(y) is a polynomial of degree n—1. We have to ver- 
ify that all the necessary hypotheses are fulfilled. We have 


n 
O(y)= Xa; yey 
where, according to (28), 


n-l u-l 


u-l 
(31) Ja; ,|<e(n)N ® =e(n)(N nm )n-T 


and, as is easily seen, 
av 
lm’|<c(n)N ® 
(because m and all ¢,(y{/)) satisfy this inequality). 


In virtue of the last inequality, the role of N can be assumed by 
the number c(n)N("-1)/"; then the conditions (31), which the coef- 
ficients of the polynomial ¢,(y) satisfy, are precisely the conditions 
(23) with n replaced by n—1. Thus all the hypotheses are indeed 
fulfilled, and we can assert that the number of solutions of equation 
(30), for which |y{7)| < 2N1/n— 2(N(2-1)/n)1/(m-1) does not exceed the 
number 


o 


nel ky k‘-n+1 
(32) c(n)(N ® )rol =c(n)N ® 


This estimate is obtained for the fixed values yk), v{2°), 
Clearly we have at most 


? 


; 25-k 
(33) (41/7 4.1)2°-# <e(n)N 
such systems of values. The total number of solutions of the re- 
quired type, of equation (29), therefore does not exceed the product 
of the right-hand sides of (32) and (33), i.e., it is at most 


25-n4+1 


(34) c(n)N ” 


We now return to equation (27). We saw before (p.59) that every 
z, can assume only the values lying in the interval <—¢(n)N@-1)/n, 
+ ¢(n)N'"-1)/2>, Now we see that the “multiplicity” 
values (i.e., the number of ways of choosing the yfi) 80 as to Sat- 


of each of these 
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isfy the equation) does not exceed the number (34). 

This result makes it possible to reduce the whole problem to an 
estimation of the numbers of solutions of linear equations. For at 
the end of §5 we reduced the estimation of r,.(m) to the estimation 
of the numbers of solutions of equations of the form (26). But as we 
proved by an application of Lemma 5, the number of solutions of 
equation (26), for which |y,| <$2N1/", is at most equal to the sum of 
the numbers of solutions of 2° equations of type (27), i.e., already 
linear equations. In this connection we obtained limits within which 
the unknowns 2z, are allowed to vary. A certain new difficulty (the 
price we have had to pay for the transition to linear equations) is 
that the new unknowns z, have to be considered with certain multi- 
plicities (for which we have also determined limits). 

I'inally we must not forget that all these calculations are made 
under the assumption that the numbers A, are chosen and fixed. 
Therefore we still have to multiply the result obtained, by the num- 
ber of all such possible choices. 

The final result of this section, which we have to keep in mind, 
reads: Uur estimated number r,(m) does not exceed the sum of the 
numbers of solutions in integers z,, |z;|<e(n)N@-1)/", with multi- 
plicities d,<c(n)N(2°-"+1)/", of equations of the form 


(35) Lh =0, 


where w runs through the values 0,1,...,2°—1, and the numbers h, 
(l<grg2Sko) assume, independently of one another, all integers in 
the interval <—2N1/",42N1/n>, 

And so we see that we have now obtained an estimate for r,(m), 
in whose formulation the given polynomial f(x) does not appear, 
which lends this estimate a very general character. 


§7 
CONCLUSION 
Now that we have reduced the problem to an estimation of the 
number of solutions of linear equations which are independent of the 
special form of the polynomial f(x), we quickly reach our goal with 
the aid of Lemma 3. 
Denote by K any particular combination of the numbers A,, 
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|A,|<2N1/™ (1gisk/2), and by U,(K) the number of solutions of 
equation (35) for this fixed combination K and for a certain pre- 
scribed w, where we are concerned with those solutions z, which 
satisfy the inequalities |z,|<eln)N@-1)/n, with multiplicities A,< 
e(n)N(2°-+1)/n, Then, according to the final result in the preceding 
section, 
2-1 
rem) ELE UCR, 

where the summation over K extends over all admissible combina- 
tions K of the numbers h;. This can be written 


F) 
rims 2 1EU,(KN. 


It is immediately evident, however, that for different w the sums 
2 U,{K) do not differ from one another at all (because for different 
w the equations (35) do not differ from one another in any respect). 


We can therefore write 


r,(m) $28 2 Uo(K) =¢ (n) ra UK). 


Here Uo(K) is the number of solutions of the equation 

for the given combination K of the numbers h,, |h,| < 2N} /n (1gisk/2), 
where |z,| <eln)N(m-1)/n and the z,; have multiplicities A,<c(n)- 
N(2°-n+1)/n, Let us denote by U%(K) the number of solutions of the 
same equation under the assumption that all z; are simple. Then 
clearly 


25-n+1 
Ug(K)<ie(n)N *%  }*0 US(K), 


or, recalling that ko=2n, 
Ug(K) < c(n) N2(2°-"+1) U8(K), 

and hence 

(37) r, (m)<c(n) N2(2°-9+1) 2 U&(K). 


Now let us note the following. Every K represents a certain admis- 
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sible combination of the values of all A; (lgigk/2); the number 
US(K), however, is completely determined by the values of the first 
ko=2n of these values (1¢i¢2n), because they alone appear in e- 
quation (36). Of course when we choose a certain fixed combination 
K, we thereby also uniquely define a certain combination K “ of the 
values hi,ho,...,4,,. But if, conversely, a certain combination K’ 
of the numbers hy, ha, ..,hg, is selected, there corresponds to it 
not the single combination K, but rather as many as there are ways 
of choosing the remaining “‘supplements” h, (2n<i<¢k/2). Since 
every A; must belong to the interval <-2N1/n,42N1/">, it is evi- 


dent that to a combination K ’ there correspond at most 
c(n) (N1/n)(k/2)-20 _ (np) NOR /2n)-2 
combinations K. Hence 


= UR(K)< c(n) N67 2n)-2 > U#(K’), 


where U*(K‘) is the number of solutions in integers ze \z |< cn). 
N(n-1)/n (1 <i¢2n) of equation (36) for the given combination K ’ of 
the numbers h,, JA, |< 2N1/" (1<gi<2n), and the summation is to be 
extended over all such combinations. From (37) we therefore obtain* 


(88) 14m) x c(n)N2(2*m+1)V(K/20)-2 &UXK") =e n\N2E* SUBIR’) 


Finally, 2U*(K’) is immediately estimated with the help of 
Lemma 3, here we have to put /=2n, A =2N}/n, B=c(n)N(@-1)/n, 
you can easily verify that al] the hypotheses of Lemma 3 are satis- 
fied. On applying this lemma we find that 


2, UX(K’) ¢c(n)(AB)22-! =c(n)N2"-!, 


At last, inequality (38) yields 


1 


s+1_j meln)NT — ; 


r,(m) < cln)N2(25* -n),y2n-1 =c(n)N2°2 
which completes the proof of the fundamental Jemma and thereby 
also of Hilbert’s theorem. 

This proof, so exquisitely elementary, will undoubtedly seem 
very complicated to you. But it will take you only two to three 
*Recall that k=2n-25 +}, 
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weeks’ work with pencil and paper to understand and digest it com- 
pletely. It is by conquering difficulties of just this sort, that the 
mathematician grows and develops. 


